Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
Q-Learning, Policy Gradient, RL Agents, Game AI
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
3775
posts in
115.7
ms
Show HN:
Fighting
the War Against
Expensive
Reinforcement Learning
cadenza-landing-qtu7gbjwb-akshparekh123-3457s-projects.vercel.app
·
12h
·
Discuss:
Hacker News
🤖
AI
Recursive
self-improvement
from AI models
marginalrevolution.com
·
2d
·
Discuss:
Hacker News
🤖
AI
Formalization
and
inevitability
of the Pareto principle
arxiv.org
·
14h
·
Discuss:
Hacker News
🤖
AI
Part 2 - AI Chat Evaluation of the Formal Language in He
Xin
's
PEPC
System
news.ycombinator.com
·
1d
·
Discuss:
Hacker News
🤖
AI
Your AI Strategy Has a
Human-Shaped
Hole
superiortech.io
·
5h
·
Discuss:
Hacker News
🤖
AI
Architectural and Mathematical
Foundations
of Machine Learning: A
Rigorous
Synthesis of Theory, Geometry, and Implementation
chizkidd.github.io
·
1d
·
Discuss:
Hacker News
🤖
Machine Learning
ashworks1706/rlhf-from-scratch
: A theoretical and practical deep dive into Reinforcement Learning with Human Feedback and it’s applications in Large Language Models from scratch.
github.com
·
2d
·
Discuss:
Hacker News
🔥
PyTorch
For real
game-theoretic
reasoning, we need best response in
imperfect
information games
weyxie.bearblog.dev
·
3d
·
Discuss:
Hacker News
🤖
AI
Show HN: A
minimal
online decision maker
decisionmaker.online
·
1d
·
Discuss:
Hacker News
🤖
AI
Thanks
to AI, we can play a
Roman
game again
maastrichtuniversity.nl
·
12h
·
Discuss:
Hacker News
🤖
AI
Exploring Chess
Positions
and
Counts
win-vector.com
·
23h
·
Discuss:
Hacker News
🤖
AI
Training A Small Language Model To
Outperform
Frontier Models On
CRM-Arena
neurometric.substack.com
·
7h
·
Discuss:
Substack
🤖
Machine Learning
Robots
That Can See Around
Corners
Using Radio Signals and AI
seas.upenn.edu
·
1d
·
Discuss:
Hacker News
🤖
AI
Bringing a
jewel-encrusted
warhammer to a knife fight
reorchestrate.com
·
22h
·
Discuss:
Hacker News
,
r/rust
⚙
Mechanical Engneering
Outcome
Engineering
o16g.com
·
1d
·
Discuss:
Hacker News
🔧
FEA
MySQL
with
extensions
for the agentic AI era
villagesql.com
·
2d
·
Discuss:
Hacker News
🤖
AI
Self-Referential
Quantum Barriers for AGI
Containment
redact-app.com
·
1d
·
Discuss:
Hacker News
🤖
AI
AI agent
sandboxing
in 2026: how to choose between primitives,
runtimes
, and platforms
manveerc.substack.com
·
1d
·
Discuss:
Substack
🔥
PyTorch
Multi-Dimensional
Computational
Library for Physics-Aware AI
splitfxm.com
·
2d
·
Discuss:
Hacker News
🔧
FEA
Overview of end-to-end
encrypted
AI inference for
Confer
news.ycombinator.com
·
1d
·
Discuss:
Hacker News
🤖
AI
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help